首页> 外文OA文献 >A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena
【2h】

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

机译:统计机器翻译中词语重新排序的调查:   计算模型与语言现象

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Word reordering is one of the most difficult aspects of statistical machinetranslation (SMT), and an important factor of its quality and efficiency.Despite the vast amount of research published to date, the interest of thecommunity in this problem has not decreased, and no single method appears to bestrongly dominant across language pairs. Instead, the choice of the optimalapproach for a new translation task still seems to be mostly driven byempirical trials. To orientate the reader in this vast and complex researcharea, we present a comprehensive survey of word reordering viewed as astatistical modeling challenge and as a natural language phenomenon. The surveydescribes in detail how word reordering is modeled within differentstring-based and tree-based SMT frameworks and as a stand-alone task, includingsystematic overviews of the literature in advanced reordering modeling. We thenquestion why some approaches are more successful than others in differentlanguage pairs. We argue that, besides measuring the amount of reordering, itis important to understand which kinds of reordering occur in a given languagepair. To this end, we conduct a qualitative analysis of word reorderingphenomena in a diverse sample of language pairs, based on a large collection oflinguistic knowledge. Empirical results in the SMT literature are shown tosupport the hypothesis that a few linguistic facts can be very useful toanticipate the reordering characteristics of a language pair and to select theSMT framework that best suits them.
机译:单词重新排序是统计机器翻译(SMT)的最困难的方面之一,并且是其质量和效率的重要因素。尽管迄今为止已发表了大量研究,但社区对此问题的兴趣并没有减少,而且没有一个问题该方法似乎在所有语言对中都占主导地位。取而代之的是,为新的翻译任务选择最佳方法似乎仍然主要是凭经验进行的。为了使读者适应这个庞大而复杂的研究领域,我们对单词重排进行了全面的调查,这被视为统计建模挑战和自然语言现象。该调查详细描述了如何在不同的基于字符串和基于树的SMT框架中以及作为独立任务来建模单词重排,包括高级重排建模中文献的系统概述。然后我们质疑为什么在不同语言对中某些方法比其他方法更成功。我们认为,除了测量重新排序的数量外,了解在给定语言对中发生哪种重新排序也很重要。为此,我们基于大量的语言知识,对各种语言对样本中的单词重排现象进行了定性分析。 SMT文献中的经验结果表明,支持以下假设:一些语言事实对于预测语言对的重排特征以及选择最适合它们的SMT框架非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号